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Abstract 

In this paper, we discuss integer-valued autoregressive time series (INAR), Hawkes point processes, and their interre¬ 
lationship. Besides presenting structural analogies, we derive a convergence theorem. More specifically, we generalize 
the well-known INAR(p), p € N, time series model to a corresponding model of infinite order: the INAR(oo) model. 
We establish existence, uniqueness, finiteness of moments, and give formulas for the autocovariance function as well 
as for the joint moment-generating function. Furthermore, we derive an AR(oo), an MA(cx)), and a branching-process 
representation for the model. We compare Hawkes process properties with their INAR(oo) counterparts. Given a 
Hawkes process N, in the main theorem of the paper we construct an INAR(oo)-based family of point processes and 
prove its convergence to N. This connection between INAR and Hawkes models will be relevant in applications. 
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Introduction 


In this paper, we show that Hawkes point processes are continuous-time versions of integer-valued autoregressive 
time series and—vice versa—that integer-valued autoregressive time series are discrete-time versions of Hawkes 
point processes; see Theorem|^for the main result of the paper. To start with, we outline the history of the concepts 
involved. 

Standard time series theory for sequences of real-valued data points has been developed in seminal works like 
Whittle ( 1951|l and Box and Jenkins (19701. This theory led to the natural question of time series models for count 


data. In the count-data context, the starting point is also a defining system of difference equations of the form “X„ - 
Y^akXn-k - En + Ti/^kSn-k, « G Z”. The main idea of the construction is to manipulate these equations in such 
a way that their solutions are integer-valued. This can be achieved by giving the error terms “(£«)” a distribution 
supported on No and by substituting all multiplications with thinning-operations. For the latter, the thinning notation 
from Steutel and van Harn (|1979|l turns out to be relevant. In the above spirit, autoregressive integer-valued (INAR) 
time series were defined and examined by McKenzie (1985 i and|Al-Osh and Alzaid ( 1987| l. The modern definition of 
the INAR model comes from Li and Yuan ( 1991| l. Latour ( 1997|l gener alizes the model to the multivariate case. For 
an exhaustive collection of properties of the INAR model; see da Silva] ( |2005| l. For a general survey of integer-valued 
time series whose definitions involve a thinning operation; see Weiss p008 l. For a textbook reference; see |Fokianos| 
[and Kedem| ( |2012| . 

The Hawkes process was introduced in Hawkes ( 1971a|b| l as a model for contagious processes such as measles 
infections or hijackings. As a point process in continuous time, the Hawkes process allows for the modeling of 
intensities which depend on the past of the process itself. Its alternative name, “selfexciting point process”, stems 
from the fact that, given the occurrence of an event, intensity jumps upwards and then decays gradually. Theoretical 
cornerstones for the model are Hawkes (1974 1 which establishes the representation as a cluster process, Ogata ( 198^ 


which covers calibration issues and propagates a recursive method for likelihood calculations, Bremaud and Massoulie 


( 1996|l which extends the original model by generalizing the affine dependence on the past to Lipschitz dependence. 
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Bremaud and Massoulie (20011 which proves the existence of a specific borderline case of the model, Liniger (2009 1 
which puts the subtleties of the definition and the construction on a solid and very detailed mathematical foundation— 
especially for the marked multivariate case, and |Errais et al.| ( |2010| l which treats an important special parametrization 
of the model from a Markov process theory point of view. For a textbook reference that covers many aspects of the 
Hawkes process; see [Daley and Vere-Jones (20031. 

To the best of our knowledge, the close connection between INAR and Hawkes processes has not been studied 
before. The correspondence between the model classes becomes even more direct if one applies infinite autoregression 
instead of finite autoregression for the time series model. This was our main motivation for generalizing the existing 
INAR(p) framework with p < oa to the case p - oo. For the new INAR(cx)) model, we give an explicit construction 
and show uniqueness. Then we derive three alternative descriptions of the process, namely an autoregressive, a 
moving-average, and a branching-process representation. Furthermore, we calculate basic quantities such as the joint 
moment-generating function and the autocovariance function. These are mainly presented for comparison with their 
Hawkes process counterparts. We observe that the equivalent branching-construction of INAR and Hawkes models 
form the core of the connection. This equivalence yields corresponding equations for generating functions, similar 
moment structures, and analogous stability criteria. Theorem [^establishes a convergence result. In this theorem, for 
a given Hawkes process N, we construct a specific family of INAR(oo) sequences 
of this family, we derive a point process by setting 


«gZJ A>0 


From each member 


^ a < b. 


n: An€(a,h] 


The theorem states that converges weakly to the Hawkes process A when A goes to zero. This result is relevant for 
applications of INAR and Hawkes processes. In particular, the convergence theorem yields an estimation method for 
the Hawkes process by estimating the more tractable approximating INAR model instead. We work out this estimation 
method in Kirchner ( 201 5bj l . Moreover, from a purely theoretical point of view, the presented line of thought is useful: 
the time series perspective on point processes as well as the point process perspective on (integer-valued) time series 
can be fertile for constructing and understanding event-data models; see Sectionj^ 

The paper is organized as follows: Section 1 introduces the INAR(oo) model. Section 2 presents the Hawkes 
process in the random counting-measure framework. Section 3 establishes the convergence theorem. Furthermore, it 
collects structural analogies between the two model classes. In the final section, we conclude with a discussion on the 
broader interpretation of the INAR-Hawkes relation. 


1. The INAR(oo) model 

Throughout the paper, we consider a basic complete probability space (Q, T, P) carrying all random variables in¬ 
volved. 

1.1. Definition and existence 

Definition 1. For an No-valued random variable Y and a constant a > 0, the thinning operator o is defined by 

/I=l 

where ~ Pois)®), n e N, independently of Y. We refer to as counting sequence. 

In this definition and throughout the paper, we use the convention that On 0 for any sequence (a„)„eN c K. 

Definition 2. For > 0, k e No, let e„ Pois(ao), n e Z, and ~ Pois (afi), independently over n e Z, k e N, 
/ € N, and also independent of (e„). An integer-valued autoregressive time series of infinite order (INAR(oo)) is a 
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sequence of random variables (Xn)nez which is a solution to the system of stochastic difference equations 


~ Xfi ^ ^ 

k=l 

oo X,j-k 

k =\ /=1 


( 1 ) 

( 2 ) 


We call ao immigration parameter, (s„) immigration sequence, > 0, ^ e N, reproduction coefficients, and K 
Ok reproduction tnean. 


In most situations, it is enough to use the thinning notation from Q in Definition [^without explicitely writing out the 
counting sequences as in (|^ —^keeping in mind that each “o” operates independently over k e N and n e Z. Clearly, 
Definitions [T] and [^depend on the choice of the distribution of the counting sequences. A more obvious option would 
have been sequences of Bernoulli variables. This has in fact been the choice in the cited INAR(p) literature. The 
Poisson choice for the counting sequences, however, again yields a Poisson distribution for “X„\X„-i,X„^ 2 , ■ • ■ This 
in turn leads to formulas that are simpler and that can be compared with their Hawkes counterparts more directly. 
We will address this issue in Sections 3.4 and|^ For the following existence and uniqueness result, any No-valued 
distribution with finite first moments would do for the counting sequences. Throughout our paper, “stationary” is 
understood as “strictly stationary”. 


Theorem 1. Let ak > 0, k e No, with reproduction mean K ak < 1- Then has an almost surely unique 

stationary solution where € No, n e Z, EX„ s q'o/(1 ~ K), n e N. 


Before giving the proof, we highlight the branching nature of the solution to 0. As we will see, one can interpret 
this solution as a model for the size of a population, where each individual is alive exactly during one time-step. 
Furthermore, each individual is either an immigrant or stems from a prior individual. This is similar to a Galton- 
Watson framework with immigration; see Section 5 in Seneta (1969 1 . In contrast to the Galton-Watson setup, however, 
each INAR(cx)) individual does not only have offspring at the next time-step but (potentially) at any future time. 
The proof first formalizes this structure and then establishes that the construction indeed yields a process with the 
desired properties. We emphasize the branching intuition of family processes consisting of generation processes by 
the somewhat unusual but suggestive notation for stochastic processes {Fn) and (G„). 


Proof Let s,- ~ Pois (ao), i e Z, be the immigration terms from Definition|^ For the y'-th potential immigrant at time 
i 6 Z, we define generation processes g e No, by the following recursive procedure; 


Gfl ’ I — l{n-o}, n € Z, / G Z, y € N 




n 


'^akoG 


n-k 


k=l 


(3) 

(4) 


For all distributional properties of the construction, it will be enough to apply defining equation Q for the generations. 
We will consider an explicit representation of the involved thinning sequences later. Note that - 0 whenever 

n < 0. A family originating from the y'-th individual immigrant at time i is the superposition of all the corresponding 
generation-processes: 


;= Yj n e Z, i e Z, ;• 6 N . (5) 

g=Q 

The candidate series ^X„^ for a solution of (|^ is the superposition of all these families—modulo an appropriate shift 
in the time index: 




(ij) 


n € Z. 


j=i 


(6) 
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Note that only family and generation processes indexed by (/, j) with j e {1,..., s,) come into play. Also note that all 
infinite series involved in the construction above are well-defined because their partial sums are nondecreasing. As a 
first step, we remind ourselves that K - and establish 

OO 

E 2 ge No, i eZ,je N, (7) 

n=0 

by induction: for g = 0, (|7]i is correct because we have E 2“„o 1{«=0) = 1 = For g > 0, one can show 

that E 2^0 G^n'''^^ = ■ E 'Zi7=o Gn® and (|7]i follows. By Q, 2^=0 expectation 1/(1 - K). In particular, 

this expectation is almost surely bounded. We conclude that € No, n e Z, almost surely. For X„, n e Z, we find 
that 


EX„ 

n Sj n n OO oo n oo 

i=~cc j=l /=-oo /=-oo g=0 g=0 i=-oo g=0 


ap 

1-K' 


Consequently, X„ is almost surely bounded and therefore almost surely No-valued. Note that, by construction, the 
generations and therefore the families are independently and identically distributed time series over 

i 6 Z and j € N. Stationarity of (x„^ then follows from the i.i.d. property of the immigration sequence (e,). 

To show that our candidate sequence {X„) indeed solves Q, we have to work with an explicit representation for 
the thinnings involved in the Q. To that aim, let 

~ PoisCaj:), independently over i,n e Z and k, /, g, m e N. (8) 


In our branching terminology, denotes the number of offspring individuals at time n whose parent lived at time 
n - k. This parent belongs to the (g - l)-th generation of family (/,/). Furthermore, this parent is the m-th such 
individual. We repeat the defining recursion for the generation processes from above—this time we represent the 
involved counting sequences explicitly: 



(9) 

k=l 

-Z ZCI^’ «6Z,f6Z, ;eN,geN. 

(10) 

(11) 


lc=l m=l 


It is obvious that ([TT|i justifies the distributional assumptions on ( [TOl i (i.e., Q), used in the first part of this proof For 
any n e Z, we find 


n-l 


n-l 


^■=EEfi‘-’=EE +E G" ‘E EE 


( 12 ) 


i=—oo j-\ i=—oo j=\ j=l i=-oo j=l g=0 

Note that the third summation really starts in ^ = 1 because = l(«-/:=o} = 0 whenever i < n - 1. For the triple 

sum in 0, we obtain 


n-\ £-, oo 


n-l £i oo n-l ^n-i-k 

EEE^:'‘EEEE E 4“;“ 

i=-cc j=l g=l /=-oo j=l g=l ^=1 m=l 

Ms-Ui.j) \ 


E 

k=l 


n-k Ei 


EEEE 

i=-oo j=l g=l m=l 


■(n,k) 


(13) 
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For([T 3 , we use the fact that it is irrelevant whether we let k run over {1,2,..., n - /) or over N because - 0 

whenever k > n - r,hy the same argument, we may let i run up to n - A; only. For fixed n e Z and k eN, the term in 
the bracket is a sum of i.i.d. Pois(aj;) random variables (stochastic) index set 

j,m)eZ'^ : 1 < g, i < n- k, 1 < ; < e,-, I <m< 

For the size of we obtain 


n-k £i oo 


n-k Si 


= Z Z Z - Z Z 

i=-oo j=l ^=1 /=—oo j=\ 

Let be the counting sequences from Definition]^ Note that, for n 6 Z and k € N, 

(f : / = 1 , 2 ,..., ) and : (g, i, j, m) e 


(14) 


(15) 


are equally distributed—no matter which order we choose for the second set. Also note that, for (n, k) + («', k'), we 
have that n - 0 Consequently, the independence properties over n and k necessary for the thinnings are 

preserved. We did indeed make correspondence ( [T5] l explicit. The reordering of the counting sequences, however, is 
cumbersome. It involves a function that is recursively defined on multiple levels; its presentation would double the 
length of the whole proof—yielding hardly additional insight at that. So we chose to leave it with ( [T5] l. Continuing 
with ([T 2 |i, we obtain that, for n e Z, 


n-\ Si OO 


|/(...C)| 


/=—oo j=i g=Q k=i k=l l-l k=l 1=1 

We conclude that (x„^ indeed solves 0 . 

For uniqueness, consider two stationary solutions X, Y of Q —naturally with respect to the same counting se¬ 
quences , n e Z, k e N. Then 




^ (ak o Xn-k -akO Y„_k) < E ^ |ai O X„^k -ak° T„-t| ^ E \ak o Xn^k -ak° (16) 


A=1 A=1 A=1 

As X„-k and Yn^k are thinned with respect to the same counting sequence, we have that 

F„_, 


Z^w)_z^w) 


/=1 


1=1 


^ ak°\x„^k-yn-k\, keN. 


Y^k ^n-k ^k — 

Plugging ( [T7) t in ( [T^ , we obtain 

oo oo 

E |X„ - T„| < ^ E [a, o \x„^k - T„-a|] Z ® "" K^\^n-Yn\, neZ. 


(17) 


A =1 


A =1 


As < 1 by assumption and E |2f„ - T„| < co, we get that E |2f„ - T„| = 0 and therefore X„ - Yn,n e Z, almost surely. 

□ 


7.2. Alternative representations 

Surprisingly, we can explicitly represent the INAR(oo) model as a standard AR(oo) model with uncorrelated errors: 
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Proposition 1, Let > 0, /: € No, with K — ak < let (X„) be the corresponding INAR(oo) process; see 

Definition^ Then 




k=\ 


defines a stationary sequence (t/„) with E t/„ = 0, n € N, and 


E \_UflUiqf^ 


0 ^ 


Q'Q 

l~K^ 


n ^ n', 
n = n'. 


Furthermore, we have that 


i^n -fix)-Tj (Xn-k - f^x) = ««, « ^ Z, 


(18) 


(19) 


( 20 ) 


k=l 


where fix ■— EXq = aa/il — K). In other words, (m„) is a (dependent) white-noise sequence and the time series 
(X„ — px)nez described in terms of a solution to an ordinary AR(oo) system of difference equations. 

Proof The sequence values m„, n 6 Z, are well-defined because the partial sums of OkXn^k are nondecreasing 
and their expectations have a finite limit. Stationarity of («„) follows from stationarity of {X„). For the expectation, 
we find 


Eu„ = E 


Xfi CTo 'y O^fi-kXfi 


k=l 


MC 


= px-ao- Kpx = 0 . 


For the autocovariances of the errors, first of all note that the errors m„ are uncorrelated with any previous value 
X,,', n' < n, of the INAR(oo) sequence. So that, for n' < n (and then, by symmetry, for n' + n), 


E [u„u„>] = E 




n' tXQ ^ tXkXii'^k — E \UfiXff') E [M/iUo] ^ Clk E [u,iXfi'-k) — E [m^] QTq — 0. 


k=l 


Since E = 0 and E = 0, for A: 6 N, we obtain 

E[«^] 

/ oo 

n ~ ^0 ~ ^ ^ ^k^n- 


= E 

= E [u„Xn] 

= E 


k=\ 


= E 


= E 


= E 


j ^k ^ Xfi-k 
k=l 

OO 

~ ^ j ^k^n~k ^ ] ^k 

k=l A k=l 

oo oo \ / 

^ ] ^k ^n-k ~ Q'Q “ ^ ^ ^k^n-k ^ ^ ^n-k 

k=l k=l ) V k^\ 

oo oo / oo 

^ti ^ ] ^k ^n-k ^ ] ^k ^n-k ^ ^ 

k^l \k=l 

oo oo / oo 

^ ^ ^k ^n-k ~ ^ ] ^k^n-k ~ ^ ] ^k^n-k ^ ^ ^k ^ 


k=l 


/ 

\ / oo 


Q^O^n ^0 


k=l 


k=l 


\k=l 


/ \k=l 


Using independence of from the past of the process and using E = Var(£:„) = cro, we get that 


E ^uf^ = ao + E 


^ ^ ^k ^n-k ^ ^ ^k^n-k ^ ^ ^k ^ ^n- 


\k^\ 


/ k^\ 
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\k=\ 

















































For the expectation part in the latter equation, we condition the difference in the expectation on cr(X„_j:, k e N), the 
past of the process. Then the thinnings become the only source of randomness. Note that the counting sequences of the 
involved thinnings only feed into and that they are independent of the past values ; see Definition|^ 

From this independence, we obtain 


E 


^ ^ ^ ] ^k^n-k ^k ^ 


\k=l 


/ k=l 


\k=l 


= E 


o-(X„^k, keN) 


oo \2 oo / oo \ 

^ ^ ^ ^n-k ~ ^ ^ ^k^n-k ^ ^ ^k ^n~k 


\k=\ 


/ k=\ 


\k=\ 


Xn-k=Xn-k,ke^ 


for the conditional expectations. So in a first step, we take the unconditional expectation with respect to the thin¬ 
nings only and treat the X„-k variables as deterministic. Observe that, for deterministic {xn^k)km ^ No such that 
2^1 OkXn-k < we have that 


/ OO ' 

2 oo 

' oo '' 


' oo \2 

^ j ^k ^n-k 

Alt-1 ) 

“ ^ j ^k^n-k 

k=\ 

U-1 / 

= E 

^ ^ ^k ° ^n-k 

Vit-1 / 



2 


(Xk ^ Xfi^k 


Var 


Z 

\k=\ 


\ 

^k ° ^n-k 

j 


One can show by standard arguments that the variance of these series is the limit of the variances of the partial sums, 
hence 


Var 


z 

VA :=1 


\ 

^k Xfi-k 

j 


y] Var(Q;<. o Xn-k) = ^ akXn~k (< o°)- 
k=\ k=\ 


( 21 ) 


For the last equality, we use that, for x 6 No and a > 0, a o x is the sum of x independent random variables with 
distribution Pois(Q'); see Definitionj^ As a^X^^k is almost surely finite, we may average ( |2T] i over the possible 
values of Xn-k and conclude 





' OO 



E ^u 

- ao -1- E 

Var 

/ J ^k 





U=1 


(x„)=(X„). 


_ V' MC ao 

= ao H- E2_^ QkXn^k = ao + Kfix = ^ _ ■ 


jt=i 


This establishes ([T9]l. Equation (|20| is a simple algebraic transformation of (fTS)). 


□ 


The third description of the INAR(oo) model is the representation as a standard MA(oo) time series. It will be most 
helpful for establishing the second-order properties of the process. 

Proposition 2. Let ak > 0, k e No, with at < 1. Then the corresponding INAR(oo) process from Definition^is 
a solution to the family of equations 

OO 

Xn- px^'^PkUn-k, neZ, ( 22 ) 

k=0 

where (uf) is the white-noise sequence from Propoiif/onj^ px E2fo = aol(l-K),f3o 1, andfk Zti o^fik-i, k e 

No- Furthermore, jSt > 0, k e No, and Yj‘k=oPk — 11(1 — K) < co. 


Before giving the proof, we note that the moving-average coefficients (/St)teNo defined above correspond to (E 
the expected values of a single family-process after k - 0,1,2 ,... time steps; see •Si- Also note that Proposition 
above is not a mere corollary of standard results like Theorem 3.1.1 of Brockwell and Davis ( 1991| l, stating that 
ARMA(p, q) processes (p, q' < oo) are MA(oo) processes. The argumentation in our case, i.e., p - oa, has to be more 
subtle: we have to prove that the (in general infinitely many) zeros of the involved power series can be bounded away 
from the unit circle. 
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Proof. Let B be the backward shift operator defined by Xn-k, k e Z, for any sequence {x„)nez- Consider the 

power series f fe) 1 - Zti With these notations, we may rewrite ( |20| as (p{E) (X„ - jUx) - Un, n e Z, where 
(m„) is the white-noise sequence from Proposition[^ Note that 0 < \4>{z)\ < 2 for |z| < 1. Indeed, for |z| < 1, 


and 


\f{z)\ = 

oo 

1 - ^ CtkZ^ 

> 1 - 

OO 

Y^^kz'^ 


k=\ 


k^\ 

\fiz)\ = 

oo 

1 - ^ O'kZ^ 
k=\ 

< 1 H- 

oo 

Y^^kz'^ 

^=1 


> 1 


< 1 


- ^ ak\zt >l-K>0. 

k=\ 


+ Yj <1+K<2. 

k=l 


As (f>(z) + 0 for |z| < 1, we may define the function \f(j) := Ijcpiz), |z| < 1. The original function (p is analytic 
on |z| < 1, so i/f is also analytic on the open unit-disc and, consequently, we have a power series representation 
fiz) = Tj^k^oPkzl^^ 1^1 < L As 1 = ip(z)4>(z) by definition, if follows that, for |z| < 1, 




k=Q 


l=\ 


k=0 V 


7=1 


Comparing coefficients in ( |2^ , one obtains the recursion 

k 

/?() = 1 ?Lnd Pk^Y ken. 

1=1 


(23) 


We note that > 0 because ak > 0. Formally, we can write 


Xn - Px ^ neZ. 


(24) 



absolutely convergent. By the same calculation as in ( |2^ , we find that the Fourier-coefficients of 1/^(0) are exactly 
the Pk, k e No, from our xjj function. From this we get 


\ ^ I „ I Lemma 

2^Pk\ = 

k=0 


1 

W) 


1 

m 


1 


l-K 


< oo. 


We conclude that ( [24l l is a meaningful family of equations. In other words, {X„ - px) can be represented as a moving- 
average process with respect to the white-noise sequence (m„). □ 


From the explicit construction in the proof of Theorem [T] we find the following branching representation of the 
INAR(cx)) process. The branching formulation will be useful for the derivation of the moment-generating function. 
It furthermore summarizes the most elegant and, at the same time, efficient way for simulating from the INAR(oo) 
model: 


Proposition 3. Let (X„) be an INAR(oo) sequence with respect to an immigration sequence (e,) and reproduction 
coefficients ak > 0, k e N, so that Uk < 1; see Definition^ Then 




6 Z, 


/eZ _;=1 


(25) 


where are independent (over i e Z and y e N) copies of a branching process (F„) defined by 


neZ. 

s=o 


(26) 
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The generations (G„) in ( |26| t are constructed recursively by 


« ^,,-k 


— 1|«=0) and ^ o Gjf_/^ ^ 


(27) 


/^=1 


^=1 m=l 


w/f/i ~ Pois{aic) independently over m,n,k,g. Furthermore, we have the following distributional equality for 

the generic family-process (F„): 

n 


(Fnlnel, - 


1=1 >1 


(28) 


Proof Equation ( [25] ) together with ( |26l l and ( |Z7l i is exactly the construction of a solution to the dehning difference- 
equations (|^ in the proof of Theorem [TJ see Q and ( [TT] ). To establish ( |28l l, consider the process on the right-hand 
side: 

n 


(f"Lz 




1=1 j=\ 


We show that the process (f„^ is constructed by the same (stochastic) recursion as (F„y, see ( |26l ) and Then the 
equality in distribution follows. For n eZ, we dehne 


G 


Gf:=l|„=o) and ^ Z ^ ^ 


(29) 


1=1 >1 


where , g e No, are the generation processes that constitute the family processes , i e Z, j e N, in 

( |28| ). In particular, are independent copies of the generations dehned in ( |27] i. Then, by construction, 

F„ - n e Z. This establishes a representation for (^F„^ of the same form as ( |26] l for iF„). Next, we show 

that the summands follow the same recursion ( |27l l as the original generations (G^f^): for g = 0, we have that 
G^n^ = ljn=o), n e Z. So the starting value of the recursion for is the same as the starting value of the recursion 
for For g = 1, recursion is also analogue: 


“ Z Z “ Z Z -TjOko i|„_^=o) -TjUko G^ 


:( 0 ) 


-k- 


i=l ;=1 

And, for any g > 2, we find that 

n 


i=l j=l 


k=l 


k=l 


d) 


r-(» 

n n~i 


t 

n n-k 


:(1) 


= 2 Z =Z Z Z ■'* “ =Z Z Z “ <PiP- 

i=l j=\ i=l 7=1 A:=l k=l i=l j=l 

where in the third equality we use that = 0, g > 2, n - i - k < 0. At this point we avoid the explicit 

representation of the counting sequences. We just remind ourselves that all thinnings involved are independent and 
establish 

n n-k G^” „ 

g'®^ = Z ° Z Z = Z ° neZ,g>2. 

k=l i=l 7=1 ^:-l 

In other words, the processes (G„) and (g„) and, consequently, the processes (F„) and (^F„^ are constructed by the 
same stochastic recursion. We conclude that (F„) = (f„). This establishes ( |28l ). □ 
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1.3. Moment structure 

From the representation of the INAR(oo) sequence as a superposition of shifted i.i.d. family processes in Proposition]^ 
above, we derive equations for the joint moment-generating function of the model. First, we fix some notation; 

Definitions. For any sequence (f„)neNo c K, let supp((f„)) min{n 6 No : 4 = 0, A: > n) be the support of the 
sequence (f„). We use the convention that min0 := oo. Furthermore, for A c K, let coo(A) {(fn)neNo c A : 
supp((f„)) < oo}, the space of sequences in A with a finite number of nonzero values. For any time series (Fn)neNo. we 
define the joint moment-generating function 


M(Y„)i(ln)neNo) := Eexp 



(t„) e Coo (K) ■ 


(30) 


The somewhat unusual definitions above have been chosen for most direct comparison between the INAR(oo) joint 
moment-generating function and the Laplace functional of a Hawkes process; see Propositionand Proposition]^ 
below. 


Theorem 2. Let {Xf) be an INAR( oo ) sequence with respect to immigration parameter ao > 0 and reproduction 
coefficients a/t > 0, k e N, such that K = < 1/ ■see Definition^ Then there exists a constant 6 > 0 such that 


M, 


(X, 


)((t„)) < expjiiao^^l < (L) e coo((-oo,d]). 


where d supp((f„)) + 1 is the maximal number of nonzero values of{t„). Furthermore, we have that 

M(X„){{tn)) - exp I Q'O ^ ’ 


(31) 


(32) 


where we set := 0/or m < 0. Here, (F„) denotes the generic family-process from Proposition^ Its joint moment¬ 
generating function M(^fj is the unique solution to 


(33) 


A^(F,.)((s«)) = e'» exp ak[M^F„)i(Sn+k)neN) “ ^ eoo((-oo, 5]). 


Proof. For ( [32l i, we first apply representation 

M(x„) ((D) = Eexp|^f„X„| = Eexp|^f„X„| = Eexp|^ ^ ^ i =E ]~[ ]^ exp1 

I ln=0 ieZ j=l I ieZ i=\ U=0 j 


l.«=0 


«=() 


In the following, we set f„, 0 whenever m < 0. Conditioning on the immigration sequence (e,) and exploiting its 

independence from the family processes yields 


M(x„)(ti ,..., frf) = E j~[ j~[ E exp i ^ tnF^f 
ieZ j=l l«=0 


■f)l 


j~[E E expl^fnFn^, 


I «gZ 


- j~~[ E [M(/rj((fi+„)„eNor‘] 

ieZ 

= n {»o(M(f„)((f,'+„)„eN„) - l)} 


= exp j ^ a'o(M(f j((f,-+„)„eNo) “ l) [ ’ 
I i'eZ j 


(34) 
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where in the last but one step we applied the formula for the probability-generating function of a Poisson random 
variable. Up to now, is only a formal representation of M^x„) in terms of It is not clear yet, whether and 

when M(x„) is finite. For ( [3^ , we apply the equation ( [27] i of (F,,) from Proposition]^ 


iW(F„)((-s«)) = E exp 1 ^ s„F„ 1 ^ E exp • 

OO 

Z"” 

' n ^ 

ii-o)+zz^«'i 

U=o j 

n =0 

1 j=i J 


We note that in the last term, the index k may run to oo instead of n, because - 0, j e N, whenever k > n. After 
straightforward calculations, we obtain 


iW(F„)((sn)«eN„) = E exp 


Wl) 

oo oo 

/2=0 k=\ j=l 


- exp 


| y] ak{M(F„){{Sn+k)neno) “ l)| 


(35) 


Next we derive finiteness of M^f,,)- Let (i„) be a sequence with finite support and s max{i„) so that ts 

bounded from above by sS, where S Z“=o denotes the total number of individuals in the generic family (F„). 
We remind ourselves of the defining equation (|26|) for the family process {F„) and find that 


OO OO OO OO OO 

* = Zf" = ZZ°“ = ZZc?’ 

«=0 «=0 ^=0 g=0 n =0 


(36) 


We denote the total number of individuals in the g-th generation by Yg 2^=0 S ^ No- The sequence 
is the embedded generation process. Applying (|27ll, we find that Fo = 1 and, for g >2, 


= Z Z ° = Z Z ° =Z ° Z = Z ° = Z 

«=0 n=0 A:=l ^=1 «=0 ^=1 n=0 k=l k=l 


■is) 

k ’ 


where Pois(/f), A:, g 6 N. In other words, the embedded generation process ^Fg) is a standard Gabon-Watson 


branching process. From ( [36l l, we see that S - Z”=o ^g- other words, S is distributed like the cumulative limit 
of a standard Galton-Watson process. The moment-generating functions of such limits have been considered in the 
literature: as K < 1, by Theorem 2.1 in |Nakayama et ah ( 2004| l, there exists a 5 > 0 such that Eexp{55) < oo 
if and only if there exists a § > 0 such that Eexp{(5^j^''} < oo. The latter is indeed the case because the moment¬ 
generating function of a Poisson variable is finite on M. Furthermore, by Jensen’s inequality, we see that 1 < 
lim„^oo Eexp{d5/n) < lim„^oo (Eexp{d5))'^" = 1. So, for any given e > 0, we can assume the existence of a 
6 > 0 such that E exp {6S ) < 1 H- e. In particular. 


3d> 0 such that M(/r,_)((i„)) < Eexp(55) < 1 -i-(l - K)I(2K) for (i„)„(=N„ e cqo ((-oo,5]), (37) 


where, as before, K - cr^. < 1. We now have established finiteness of M(^f„) in a neighborhood of zero. It remains 
to establish finiteness of M(x„)- Our goal is to bound the series representation ( [34l i of M(x„). To that aim, we need 
to refine the bound P7| ) for M(/?„)((s„)). To that aim, with the constant 5 > 0 from for i 6 Z, we introduce the 
sequences ^5®) defined by 



n — i, i — I, ..., i — d, 
else. 


(38) 


where d = supp((f„)) -i- 1, as before, with (t„) the considered argument sequence. Note that, for i < 0, we have that 
dj,'' = 0, n e No. Consequently, by definition of M(f„), for i < 0, we have that M(f„) Furthermore, 

observe that (d®j.) = , i,k e Z. For (f„) e coo((-°°,5]), we have by component-wise monotonicity of M(x„) 
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that 


M(x„m) < M(x„) ((4t)-N„) ^ expjg ao(M(f„)((4t)neNo) - l)|=^^p| Z t^o(M(f„)((5r'>)„eN„) - l)j 


exp |ao ^ '«/|, 


where we set m; := )„eNo) - 1, ! e Z. Note that, by ( [TT] ), we have that 

0 <mi<{l - K)li2K), ieZ. 


( 39 ) 


(40) 


and, in particular, m, = 0 for i < 0. In the following, we only consider m, with i > d. In this case, we get from ( [38] l 
that e^o - - I and we obtain the recursion 

ntj = M(/r,_)((5®)„<=No) - 1 ® e'*" exp Q;/t(M(f„)((d®^)„eN„) - l)| - 1 ^ exp aimj-ij - 1, i > d. (41) 
For the sum in the exponential, we find that 

a,(l - K)/(2K) ^K(l- K)/(2K) <1, ieZ. 


k=l 


k=\ 


Therefore, we may apply the exponential inequality exp(x) <(l-x) ',x<l,in (|4T]i: 


m: < 


_ Z'k=i akkrii-k 1^ Z‘k=i Okm-k 


I-Y!k=\°^kmi-k 1 - Iii=i 1 - a,t(l -/r)/(2/:) 

Summing both sides of (|42ll over i > d, we obtain 


2 Etli akkkli-k 

Ajk-I . ^ ^ ^ 42 ) 

\+K 


OO ^ OO OO ^ OO OO ^ oo oo a OO 

Z Z Z t‘4y| Z ZA 


iW+1 


i=d+\ A:=l 


A:=l i=d+\ 


k=\ /=0 


' d oo \ 

nil 

V/=0 l=d+\ ) 


Keeping in mind that 1 - 2K/(1 + K) > 0, we solve ( |43| ) for YT=d+\ kki-i find that 


( 2K \ ‘ 2K ® 

‘-iTx ITxA"'^ 

i=d+l ^ ' /=0 




(43) 


(44) 


For the summation of (/«,),gNo over i e No, we finally obtain 

“ 0 ^ m 

> nii s y nii + d + \ S (d + 1) 

i=0 i=0 


1-K 

2K 


+ 1 =(£/+!)- 


1 +K 
2K ■ 


(45) 


We conclude that, for all (f„) e coo((-°°,^]) with supp((f„)) = d, we have that 


M(x„)(f 


i,...,fj) 9 expjao Z m\ 9 exp jcro^/- 

^ i=o ^ 


+ K 


2K 


< OO. 


(46) 


Uniqueness of follows by induction over the (finite) support of the argument sequence. In that sense, the 
implicit equation ([T5|) specifies M(/r_) recursively. □ 
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As a matter of fact, every moment-generating function that is finite in a neighborhood of zero has a Taylor series about 
zero. The coefficients of this series are the joint moments. Consequently, from Theorem]^ we obtain 

Corollary 1. Let (X„) be an INAR(oo) sequence as in Theorem[2 For m 6 N and ki,.. .,ki„ e No, we have that 


E 




■Xt 


< oo. 


The second moments of the INAR(oo) sequence are particularly tractable: 

Proposition 4. Let (X„) be an INAR(oo) process with reproduction coefficients Ok > 0, k € N, such that K 
2^1 Ok < 1> cind immigration parameter oq > 0. Furthermore, let R( j) Coy (Xn,Xn+j^, j e Z, be the autocovari¬ 

ance function of the (stationary) sequence. Then 


OO 

Rii) = ^ ye 


(47) 


where /?o 1 pk Tji=i addition, we have that 




Oo 


J=o 


(1 - 


< oo. 


(48) 


Proof With the notation from the moving-average representation of the INAR(oo) sequence in Proposition]^ we find 
that, for j, n 6 Z, 


Rii) = COV (X„ -px,X„+j- px) = COV ^ fkUn-k, ^ fk 


ttnf-j-k 


\k={) 


k=0 


From Proposition I ij we know that Coy(un,u„+j) = lp=o)«o/(l - K), j 6 Z,- Furthermore, from Propositional we 
have that the coefficients j3k are absolutely summable. So (|47ll follows from Proposition 3.1.2. in|Brockwell and Davis 




( 1991|l. For the sum of the autocovariance sequence, we observe 


OO OO 


i Proposition; 


OO OO 


Z ««= ifk Z Z'**- = ifk Z'*' Z'’" tZx Z'*' Z^*' 

k=0 k=0 i=0 i=0 k=0 . 

The last equality re-uses the result A' - 1/(1 - K) from Proposition]^ 


oq 


'+k 


i=0 k=-i 


(l-K) 


3 • 


□ 


1.4. INAR(p) embedding 

As one would expect, the INAR(cx)) model can be interpreted as the limit of the usual INAR(p), p < oo, model: 
if we truncate the excitement sequence of an INAR(oo) time series after a large lag p, the resulting process is an 
approximation of the original time series: 


Proposition 5. Let (X„) be an INAR(oo) sequence with resect to immigration parameter ccq S 0 and reproduction 
coefficients ak > 0, k e N, K = Ok < 1; see Definition^] Furthermore, for p 6 N, let be another INAR(oa) 

sequence with respect to the same immigration parameter ao as above and reproduction coefficients specified 

by 


(p)._\ok, l<k<p 
-“lo, k>p. 


Then the finite-dimensional distributions of{xh’^'^ converge to the finite-dimensional distributions ofiX,,) as p goes to 
infinity. 
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Proof. It suffices to show that the corresponding moment-generating functions converge. We apply the notation from 
Theor em [2j respectively, from Definition|^ Let be any sequence in coo((-°°,^]), where <5 > 0 is the constant 

from (|39|^ We will establish that for any limp^oo .. .Jd) — , td). As a first step, we show that 

the moment-generating functions of the corresponding family-processes converge, i.e., that limp_,oo = 

M(F„){{Sn)n£¥io) for Sequences (i„)«eNo ^ coo((“°°>^])- We argue with induction over the size of the support of the 
sequence; for a sequence (““> 5) with supp(i®) = 0, we find that 

Im ((i®)„eNo) ® ((i™)„eNo) ■ 


Now, for d eN, let c (-oo, (J] be a sequence such that supp d. We obtain 

i™ ^iFf) ® exp I hrn ^ - l)| • 


Clearly, lim^ 


Jp) 


- Qk- Furthermore, we see that, for k eN, - 0 whenever n> d-k. This means, the support 


of the sequences {s^^l^)n is strictly smaller than d. Consequently, by induction hypothesis, we have that, for k e N, 
limp^oo ((sll\)«eN„) = iW(F„) ((4+,t)«eNo)- We may conclude: 

jim ((sif^)„eNo) = exp “ l)| ® M^f„) ((s$f’)«eN), deN. 

This establishes weak convergence of generic family processes. In a second step, we apply formula < [32| ) for Mx(p)'- 

“ exp I ^ ^ ((^(+n)«eNo) ~ ^ ^])- (^9) 


. ieZ 


We show that we may apply the dominated convergence theorem on the right-hand side of ( |49l >: 
be sequences defined by '.-6, n e {i - d,... ,i - 1, i), and := 0, else; see ( [38) . We find, for i e Z, that 

The second inequality follows by similar induction over the support size as before. We already know from ( |45) that 
(^M(F„)i(d^n~‘^)) - l). is summable over i. In view of this dominating summable sequence, we take the limit on both 
sides of and conclude that, for (f„) e coo((-°°, 5]), 

Im ((f„)) = exp | Im ^ i(tn+k)neNo) - 1) [ 

p^OC K K j 


DC 


exp z ao{M(F„) i(tn+k)neNo) “ 1 ) / 


(keZ 

= Mxiitn))- 


□ 


2. The Hawkes process 


After the long first section on the new INAR(oo) model, the following shorter section formally presents the well-known 
Hawkes point process. We treat point processes as random counting-measures and only consider point processes on K. 
First, we fix some general notation and terminology. Then we recall the definition, the existence theorem and selected 
properties of the (univariate) Hawkes process. For the general theory, we mainly follow Resnick (1987), Chapter 3. 
For the Hawkes part, our main references are the seminal papers [Hawkes j ^91 1 a|b) and [He 


awkes 


97^ 
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2.1. Preliminaries 


Let S S(K) be the Borel-sets in M and IBb {B e S(IR) ; B bounded). A measure m on K is a point measure 
if m(B) € No, B e Sb. We denote the space of point measures on K by Mp Mp(W). Let CJ := CJ(M) be the 
space of nonnegative continuous functions on K with compact support. Point measures (m„) converge vaguely to a 


point measure m if lim„_,oo f f(t)m„(dt) f f(t)m{dt), f e CJ(K); we write ot„ ^ m. Vague convergence yields 

the vague topology on Mp. The Borel cr-algebra generated by this topology, Mp !B(Mp), coincides with the cr- 
algebra generated by the sets {m 6 Mp : m{A) = A:), A e !Bb,k € Nq; see Lemma 1.4. in Kallenberg ( 1983|l. Any 

V 

measurable mapping : {Mp, Mp) (M, S) such that lim„^oo = (t>(m) whenever m„ —» m, is continuous 

with respect to the vague topology. Our basic underlying probability space is (O, 'F, P). A measurable mapping 
N : (Q, F) ^ (^Mp,Mp ^, at N{a>) is called point process. The history of a point process N is the filtration 
where, for f € K, 


:= 6 O : N{(jS){(a, b]) e b| ; -oo < a < Z? < f, B c No j. (50) 

We assume that c F, f e K. Note that our definition of a point process allows multiple points, i.e., we may have 
that “P [V({f)) > l|A({f)) > 0] > 0”. Also note that, for f 6 K, the cr-algebra includes all sets of the form 


{w : A(w)({f„)) = Fi, L)) = 0 , n = 0 ,- 1 ,- 2 ,... || with f fo > f-i > ... and ko,k-i,... e N. 

(51) 


2.2. Definition and existence 

Definition 4. For any point measure m 6 Mp, define the Hawkes intensity 

A(t\m) rj + I h{t — s)m{ds), f e K, 

Jr 

where ?; > 0 is a constant and /i : K ^ Kq is a nonnegative measurable function with h{t) = 0, f < 0. We refer to 77 as 
immigration intensity and to h as reproduction intensity. 


The immigration intensity is often called baseline intensity and the reproduction intensity is often called excitement 
function. However, our objective is to highlight the similarity between Hawkes and INAR processes. Consequently, 
we make use of a joint branching-process terminology; see Definition]^ 

Definition 5. Let d be a Hawkes intensity as in Definition]^ A Hawkes process is a point process N that is a solution 
to the family of equations 


E 


lAN{{a, b]) 


= E 


D 

! 


Ia Ais\N)ds 


a<b,Ae 'Tjf. 


(52) 


A priori, it is not clear whether this family of equations has a solution, whether any possible solution would be unique 
(in a distributional sense) and whether the distribution of a solution would be stationary. These questions are answered 
by the following proposition; see Hawkes ( 1974|l: 


Proposition 6. Let Abe a Hawkes intensity with immigration intensity Tj > 0 and reproduction intensity h such that 
h(t)dt < 1. Then there is precisely one stationary process that satisfies ( |52| l. 

The existence and uniqueness result above is established by the observation that the solution to ( [52] i must be a cluster 
process or—more specifically—a branching process with immigration: the points are interpreted as individuals that 
are either immigrants or offspring. The immigrants (or cluster centers) stem from a homogeneous Poisson process 
with intensity rj. These immigrants form generation zero of the following branching procedure: an immigrant at time 
i 6 M triggers an inhomogeneous Poisson process with intensity h (■ - s) where h is the reproduction intensity of the 
process as in Definition]^ These offspring individuals form generation one. Each of these first-generation individuals 
again triggers an inhomogeneous Poisson process in a similar way, etc., so that the families (or clusters) are generated 
by cascades of inhomogeneous Poisson processes. 
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2.3. The Laplace functional 

The cluster and branching process point of view is also fertile beyond the results of Proposition For example, it 
leads to equations for the Laplace functional of a Hawkes process. The Laplace functional '¥n of a point process N is 
a functional defined on the space of nonnegative measurable functions with compact support by 


'P^[/] -Eexp 


r 


f(t)N(dt) 


The next proposition is Theorem 2 in Hawkes ( 1974| l —with a slight modification as the original statement refers to 
the probability generating functional whereas we prefer the nowadays more common Laplace functional notion; 


Proposition 7. Let N be a Hawkes process with immigration intensity rj > 0 and reproduction intensity h as in 
Proposition^ Then the Laplace functionalofN is 




,[/]=exp|77j ('Tf[/(f + 0]-l)df|, 


where T'/r is afunctional that is the unique solution to 

exp 




In both equalities, f denotes an arbitrary measurable, nonnegative function with compact support. 


3. Links between INAR(oo) and Hawkes processes 

In the following section, we first explain how discrete-time INAR(oo) processes can approximate a continuous-time 
Hawkes process. After the convergence theorem, we establish a number of properties of the approximating sequence. 
Finally, we collect some stmctural analogies of the two models. 


3.1. Preliminaries 

Let T„, n e N, and Y be random variables with values in some topological space. The sequence (Y„) converges weakly 
to Y if lim„^oo Ei/3(T„) = E^(T) for all nonnegative continuous bounded functions ip. We define weak convergence 
of point processes with respect to the vague topology Mp on see Section 2.1 


In this case, weak convergence 


of point processes is equivalent to convergence of their finite-dimensional distributions; see Daley and Vere-Jones 
P003| l, Theorem 11.1.VII. General weak convergence theory, as developed in the monograph Billingsley ( 1968| l, 
considers sequ ences in metric spaces. Therefore it is important to note that the vague topology is metrizable', see 


Resnick (1987 1 , Proposition 3.17. In other words, we may treat (Mp,Aip) as a metric space where necessary. A most 
helptul theorem in the weak-convergence context is the continuous mapping theorem-, see Theorem 5.1 in Billingsley 
(|1968|l. We apply it in the following form; 


• • -f 

Proposition 8. Let (Nf) and N be point processes such that Nn^ N, n oa. Furthermore, /ef / ; R ^ Rq 
be a bounded, measurable function with compact support and with a set of discontinuities Df e S such that 


P > oj = 0. Then, f f(t)N„(dt) f f(f)N{df)for n —» oo. 


3.2. The convergence theorem 

Next to the conditions on the reproduction intensity h from Definition]^ and Proposition]^ we introduce an additional 
assumption; we want h piecewise continuous. We say a function / ; R —» R is piecewise continuous if its set 
of discontinuities Df c R is finite and for all to ^ Df the limits lim,.^,-/(f) and lim,^f+/(f) exist and are finite. 
Combining all assumptions on h yields the following important technical 
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Lemma 1. Let /z : M ^ M>o be piecewise continuous function with h{t) - 0, t < 0, and J hdt < 1. Then there exist 
constants 6 > 0 and K < I such that, for any A e (0, 6), 

CO 

Aj^h(kA) <k < 1. (53) 

k=l 


In the sequel of the section, let 5 > 0 and < K < 1, A e (0, d), be as in the lemma above. We state the main 
mathematical result of our paper; 

Theorem 3. Let N be a Hawkes process with immigration intensity rj and reproduction intensity h as in Lemma 
For A 6 (0,(5), let be an INAR(oo) sequence with immigration parameter Aq and reproduction coefficients 

Ah(kA), k eN. From these sequences, we define a family of point processes by 

At(A)(A) ^ AeS, Ae(0,6). (54) 

kikAeA 


Then, we have that 


N' 


(A) 


N for A —> 0. 


Our proof uses the standard weak-convergence approach—as followed in the Hawkes context, e.g., by Bremaud and 


Massoulie (1996 1 . First, tightness of the approximating family is established. By Prohorov’s theorem, tightness yields 
weak subsequential limits for all subsequences. Then we show that all those potential weak subsequential limits 
have the same distribution as the Hawkes process. This will establish the result. An alternative approach would be 
convergence of Laplace functionals that are given by 

Proposition 9. For some A e (0, d), let be as in Theorem Let f be a nonnegative measurable function with 
compact support. Then the Laplace functional ofN^^^ evaluated at f is 


'T^,a,[/] = exp 





”)'^))neNo) ' 


■)| 


Flere, the function 'Pwi, operating on sequences (s„)„eNo ^ coo ([0, oo)) is a solution to 


exp I £ A/z(M)('T^^l,(("i+«)«eNo) - l) 


U=i 


Proof. Plugging in definitions, we obtain 


[/] = E exp |- £ f(t)M^^ (df)| = E exp I - 2 Xi'^^f(nA) |. 


By stationarity of (Xjf^^), we may assume without loss of generality that /(f) = 0, f < 0. We apply formulas ( [32) 
and ( |33) for the joint moment-generating function of the INAR((x)) process and the corresponding generic 

family process 


'I'iv(A) [/] - - /(O), -/(A), -f(2A ),...)) 1* exp I ao X (^(F/h(( " ) ' 

{ i(=7. ^ ^ « 0/ 


We set 'I'^ 4 )((s„)) M^f(A)^((-s„)), (s„) e coo([0, oo)). This establishes the lemma. □ 

The similarities between the formulas in Propositionj^above and the corresponding equations for the Hawkes process 
in Proposition 1^ are striking. Still, rather than establishing the convergence result from Theorem]^ via the Laplace 
functionals, we choose a more direct reasoning on the process level that contains useful information on the approxi¬ 
mating point process family as a by-product. Some of the properties that are necessary for this convergence proof are 
collected in the following lemmas; 
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( 55 ) 


Lemma 2. For any A 6 (0,5), let be a point process as in Theorem^ Then, for A 6 S, we have that 

A n {A:A : A: 6 Z) = 0 ^ N^^^{A) - 0, almost surely. 

For the expectation, we find that 


EN^^f{kA]) = A — ^^<5-^, keZ, (56) 

and, for a < b, 

EN^‘^\[a,b]) <{b-a + 26)-^ <oo. (57) 

\ — K 

Proof Claims ( |55] l and ( |56| ) directly follow from the definition of in ( |54l l together with Proposition and 
Lemma [T] For •nil, we find that the number of grid points in the interval [a,b] is less or equal [{b - a)/A] + 1. 
To get rid of the ceiling function, we observe that [(h - a)/A] + 1 < (b - a)/A + 2. Combining this with the facts that 
/^(A) < ^ A < 5 together with ( |55] l and ( |56] l yields inequality ( |57| ). □ 


For the second moments, we find 

Lemma 3. For approximating family of point processes from Theorem 


0 


we have that 


sup Var(A?<^^(A)) < oo, A 6 
Ae(0,<5) ^ 


Proof Let A e S;, be a bounded Borel set. Note that 


Var(N<'^^(A)) 


= Var 


’ z 

V«:/iA€A / 


^ ^ Cov(x^),4^>)= ^ ^ R^^\n 

m: mAeA n: nAeA m: mAeA n: nAeA 


m), 


where is the autocovariance function of the INAR(oo) sequence from Proposition From this proposition, we 
know that R^^fk) > 0, k e Z and ~ Applying these results yields 


Var(A<^^(A)) 


. z Z''™'”-”)^ Z 

m: mAcA «€Z m:mAeA ^ ^ 


<2 + 


sup A - inf A \ 

A j 


2t]A 

(i-Ky 


< (26 + sup A - inf A) 


2?7 

{i-Rr 


where .S' < 1 does not depend on the choice of A 6 (0, d); see Lemma[^ □ 

A family of random variables (Dig/ is uniformly integrable if limM^oo sup^g^E [l| 7 ,|>M|Fil] = 0- We obtain uniform 
integrability of the random variables in question as a corollary from Lemmaj^above: 

Lemma 4. Let approximating family of point processes from Theorem 

have that the family of random variables is uniformly integrable. 

A family of probability measures on is uniformly tight if, for all e > 0, there exists a compact 

set K c Mp such that P^'*[/r'^] < s, i e I. 

Lemma 5. The family of the probability measures ^ on (^Mp,cr(^Aipyj corresponding to the random point 

processes ^ is uniformly tight. 


0 


5 anii A 6 Sb. Then we 


18 











Proof. The claim follows with Proposition 11.1.VI. from Daley and Vere-Jones ( 2003) 1 if for all compact intervals 
[a, h] c M and for all e > 0 there exists w M < oo such that 


sup P 

Ae(0,<5) 


> M 


< e. 


The uniform boundedness of these probabilities is a consequence of Lemmaj^and Markov inequality: for any e > 0 
and a < b, let := (b - a + 26)ril(l - K), where 6 > 0 and .^ < 1 as in LemmaThen we have that 

EV(^>([a,h]) 




M, 


<{b- a + 25) 




= e, A e (0, 5). 


□ 


Proof. (Theorem^ As a consequence of Lemmathe family of probability measures that corresponds 

to the family of point processes relatively compact for weak convergence by Prohorov’s theorem; see 

Daley and Vere-Jones (^2003 i. Theorem A.2.4.1. So every sequence in respectively, 

weakly convergent subsequence. In particular, for any zero sequence in (0, 6), we can find a subsec 


, contains a 

Ae(0,5) 

subsequence (A„) such 

that converges weakly to some point process N*. If the distribution of N* does not depend on the initial choice 

of the subsequence, it follows that the original sequence converges weakly to A*; see Theorem 2.3. in |Billingsley| 
( 1968) 1. Reconsider the implicit defining-equation ( )52] i from Definition]^ By Proposition]^ we know that this 
equation determines the distribution of the solving process. So, for the proof of Theorem]^ it suffices to show that any 
subsequential limit candidate A* solves Furthermore, one can show that it suffices to prove ( ]52) i for A* e , 
where 03^ is a semiring of sets that generates the cr-algebra Pif ; see ( [50) l. A semiring is a class of sets yi such that 
for any pair A,B e Jl one has {i) A r\ B e and (ii) (A U B) \ (A n B) = U"^jA, for some n e N, (A,) c ^ and 
A, nAj = 0, i,j= We consider, for any a e M and any point process A, 


:=|{w e D : A((si,fi])(w) e Di,..., A((s<., fi])(w) e D*} : -oo < s,- < f,- < a, D; c No, k e N |. 


One can check that the set system 58^ is indeed a semiring. Summarizing the above, for the proof of Theorem]^ i 
suffices to establish 


(58) 
it 


E 


lA-A*((a,h]) 


= E 


O 

I 


U. A(s\N*)ds 


a<b,A* e 


(59) 


First, let us establish a discrete version of ([59)l for the approximating sequence: set A„ := A^^"^ for all A„ in the chosen 


subsequence. For a <b and A„ e 95”", we find that 
E 


U,, A„((a, b]) 

= E 

ia„ y 4""’ 



k:kA„E(a,h'i 



= E 

u. Z 

(A,.) 



k:kA„€(a,b] 

\ 


= E 

u, Z 

/ 

A„77 


k-.k/S.,je{a,b] 


l=l 


1=1 


AA„) 


« 6 N . 


The last step follows by the observation that the immigrations as well as the thinnings that contribute to 
kA„ > a, are independent of ,... Rewriting the inner sum of the last term as an integral with respect to the 

random measure A„, we obtain, for a < b, 






E 

lA,N„((a,b]) 

= E 

U,. 2^ A„lrj+ h(kA„ - s)N„(ds)j 




k:kA„£(a,b] ' f 


A„ e *B^", neN. 


(60) 


19 












































Note that here and throughout the proof the upper integration bounds in the Hawkes intensities do not require special 
attention due to the assumption h{Q) - 0 for reproduction intensities h in Definitionrespectively, Lemmafl] Now 
we show that ( |60l l converges to ( |59l ) corresponding to the Hawkes process. For both sides of equation ([hOlTthis is 
achieved in three steps; 

• First, we establish that the random variable in the expectation can be written as Q>{N„), where <1) ; {Mp, Aip) 

(K, S) denotes some measurable mapping with set of discontinuities D® c Mp. 

• Next, we show that P [N* e D®] = 0. 


• Finally, we prove that the random variables in question are uniformly integrable. 

By Proposition]^ the first two points together imply that <1>(A^„) ^ <1>(A^*). The additional uniform-integrability prop¬ 
erty yields that the corresponding expectations also converge; see Theorem 5.4 in|Billingsley|(|1968ll. 


Left-hand side of (@.- 
Consider the map 

d) : {Mp,Mp) ^ (K,S), m<r^ (61) 

see the definition of *8^ in ( |58] l for the notation. We claim that is vaguely continuous on Mp \ {m : m (D®) > 0), 
where D® ({a, b) U 6))- Indeed; the map m m{{a, b]) is vaguely continuous on Mp \ [m ; m{{a, b}) > 0) 

and, for i - the maps m i-^ m{{si,ti]}) are vaguely continuous on Mp \ {m : m({i,■,?,■)) > 0). The map 

No 3 (h,...,h) is trivially continuous, so that m i-3 l\m((suh])eDu...M(sk,h])eDt] is continuous on 

Mp \ ti). From Proposition]^ we have that ^ if P [A^* (D®) >0] =0. Because D® is finite, it 

suffices to show that P [A^*({f)) > 0] = 0 for any f 6 K . 

¥[N*({t}) > 0] = El^.(|„)>o < EN*({t}) EiV„({f)) f* A-^, f e K,A 6 (0,A). 

n^oc ( I - K) 

W 

So P [N*({f)) > 0] = 0 and, consequently, P [A^*(D®) > 0] must also be zero. This establishes 0(N„) ^ <1>(A^*), respec¬ 
tively, 1.4,,A^„(a, b) lA'N*(a, b). From Lemma]^ we know that (Nn{a, b)) is uniformly integrable, so {lA„Nn{a, b)) is 
also uniformly integrable. Combining weak convergence and uniform integrability yields convergence of expectations 


lim E 

n—>oo 


^A,Nniia, b]) 


= E 


lA>N*i(a,b]) 


We have established the convergence of the left-hand side of (|60ll to the left-hand side of ([59]l. 


Right-hand side of @.- 

Note that the right-hand side of (|60ll converges to the right-hand side of (|59]) if 



kA„ 

b 

t 

Z 

r h(kAn - s)Nn(ds) 

r E 

1a* f h(t-s)N*(ds) 

ke(a,h] 

xJ 

—oo 

vi 

a 

J 

—oo 


(62) 


As a first step for establishing (|62|), note that, for all choices of M with -M < a, and, for t 6 [a, b]. 


f 

J-M 


f 


h(t-s)K(ds) ^ U. h(t - s)N*(ds), n ^ oo. 


(63) 


This is due to a continuous-mapping argument similar to the one we have used for the left-hand side of ( |60l l. We 
establish that the variances of the random variables h(t - s)N„{ds), n 6 No, are uniformly bounded; 


Var 


rM/A„l rM/A„l 

hit-s)N„ids)<VaT ^ Yj Z h(lA„)h(mA„)Cov{x[f\x[t^). 

/=! Z=1 m=l 
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At this point, we write for the autocovariance function of the INAR(oo) process 
yields 


Applying Proposition 


Var 




h(t - s)Nn(^s) 


< ^ /r(lA„)sup/! ^ - m\) 

/-I m-l 

rM/A„] oo 

< /i(/A„)suph 


/=1 


(l) 


Irj^n 


/z(/Am) sup/z- 


< (suph) 


2 M + 1 2A„?7 

A„ (i_/:(a„))3 


Ipl / ,^2 2(M+ 1)77 

S (suph) -^ < Cm < °°, 

{\-Kf 


(64) 


where cm is a constant independent of n, respectively, A„. We may conclude that the random variables 1^,, hit - 
s)NniAs), n e N, are uniformly integrable. Weak convergence together with uniform integrability yields convergence 
of expectations. We have established that, for M with -M < a. 


lim E 

1a„ r hit - s)N„ids) 

= E 

U. r h{t-s)N\ds) 

n^oo 

J-M 


J-M 


(65) 


For the proof of ( |62l i, we consider a truncated part of h and its remaining tail separately. For the truncated part, we 
use ( |65l i; for the tail part, we use the integrability condition J/z df < 1: for any M > -a, we have 


^ A„E 

kA„ 

r h(kAn - s)N„(ds) 

b 

- [e 

1 

* 

dt 

ke(a,b] 

J 

—oo 

J 

a 

—OO 





kA„ 

b 

t 


< 

Xi a„e 

1a„ I h(kA„ - s)Nn{ds) 

- Te 

f h(t-s)N*(ds) 

dt 


ke(a,b] 

J 

L -M J 

J 

a 

J 

i ~M J 





ke{a,h\ 


I 


h(kA„ - s)Nn(ds) 


h 

-M 1 

1 E 

Ia’ I hit-s)N*ids) 

a 

%J 

— OO 


( 66 ) 


Let e > 0. We show that we can hnd and € N such that each of the three summands in ( |66l ) is bounded by e/3 
for n > As. First, consider the integrand of the last summand in ( |66| l. By arguing similarly to the m((a,b]) part in ( |6T| i, 
we hnd that E A*(df)/df < rjUl - K). So we can choose M**' > 0 so large that, for t € [a, b]. 


E 


£ 


hit - s)N*ids) 


< E 


£ 


hit - s)N*ids) < 


^ r 

I-KJm 


his)ds < 




3ib - a) 


The summands of the second term in ( |66l l can be bounded by e/i3[b - a]) in a similar and even more direct way— 
uniformly over n and possibly with respect to another Mf'' > 0. We set Mg := max {M^', So taking the integral 
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over the interval {a,b\ of the last summand, respectively, the Riemann sum of the second summand in (|66]l, yields 


/ 4 "X; 


huit - s)N'‘{ds) 


df < -, 
3 


(67) 


respectively. 


A„E 

k\ kis.„e{a,b] 




hM(kA„ - i)A^*(di) 



( 68 ) 


For the first term in (|66|, denote 


EM E 


f 


h(t - s)Nnids) 


and £*(f) := E 


f 


h(t-s)N*(ds) 


From ( |63| l, we already know that, for any choice of Mg, lim„^oo IE„(t) - E*{t)\ = 0, f 6 (a,b]. However, the conver- 

X I) 

E*(s)ds is nontrivial as the functions E„ are 

themselves part of the sequence. We write 



f E*(t)dt 

< 

^ nb 

^ KEnikAn)- 1 EMAt 

+ 

r £'„(f)df- 

f E*(t)dt 

k:kAe{a,b] 

Ja 


k: k^e{a,b] 


*Ja 

Ja 


The second absolute difference in ( |69l l converges to zero because of dominated convergence of {E„). Indeed, for 
t e [a, b]. 


E 


f 


h{t - s)Nn{<^s) 


X h ^ 

h(t - s)N„(ds) < suph ^ 

Me L- b\ 


Exf"^ < sup/!^^' 


■b^■n 


k:kA„e(-M^,b] 


1-K 


(70) 


Note that sup h < oo follows from the piecewise-continuity assumption. In view of the upper bound ( |70l l, we apply 
the dominated convergence theorem and choose 6 N so large that, for the second absolute difference in ( |69l l, we 
have 



En(t)dt - 


f 


£*(f)df 



n > 


(71) 


For the first absolute difference in ( |69l l, we assume that, without loss of generality, the piecewise continuous function 
h is uniformly continuous on (0, oo). Otherwise, we note that any piecewise continuous function on K that is vanishing 
at infinity is uniformly continuous on each of its continuous pieces and do the following calculation once for every 
uniformly continuous piece of h. Uniform continuity gives us a constant 6h > 0, so small, that, for any to > 0, 


\t - fol < 4 A f > 0 

( 2 ) 

Now, choose so large that 


\h(t) - h(to)\ < 


£(1 - K) 

I2ri(b - a + 6){Me + b + 6) 


(72) 


A„ < min 



£(l-g) 1 

12r](b - a + 6) sup h J 


for n > 


(73) 
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Here again, 6 and K are the constants from Lemma[^ Let a < s < t < b with t - s < A„(< 6h), then 


\En(t) - En(s)\ = 




kA„e{-M£,t) 


kA,M-Ms,s] 


^ |h(f - kA„) - h{s - kA„)\ + ^ h(t - kA„) 

V kAn G ( - Me , 5 ] kA,i£{s,t) 

Mp + 5' + A„ 


EX 


(A,.) 


e{l-K) 


A„ \2ri(b — a + 6)(Ms + b + 6) 
S , T] sup h 


+ sup h 


y 

A„?7 


1 - 


12(h — a + 6) 
s 


+ A„ 


1 -X 
s 


(74) 


12(h — a + 6) I2(b — a + 6) 6(b — a + d) 

Summarizing the above calculation, we have established the existence of an 6 N such that, for all n > Nf \ we 
have \En{t) - £’„(i)| < sl{6(b - a + 6)) whenever |f - i| < A„ and s,t e [a, b]. For the first absolute difference in i 
we therefore get 


^ nh 

^ A„E„(kA„)- I En 

A AT 


(f)df 


k: A:A„G(fl,/?] 


2 X 

_ ni 

^ 2 I 

k:kA„e(a.b] 

{b - a + A„) ^ 


</t+l)A„ 

k:k^(a.b] ^^A,, 

'(*+l)A„ 


\En{kAn) - En{t)\ df 


-df 


6(h — a + 5) 
e 


6{b — a + 6) 


n > N?\ 


Set Ne max|Xg*\ Combining ( |7T| i and ( |75] l, we get that 


ph 

2 A„£„(0- E\t)At 


k\kAe{a,h\ 


< -, n> Ne- 

3 


(75) 


(76) 


Combining (|67|l, (|68|l and (|76|), shows that (|66ll is smaller than the given e, for n > Ne and M Me, i.e.. 


X A 

ft. 

o< 


h{kA„ - s)Nn{ds) 


k:kAi,e{a,b] 


fEk. r 

Ua [_ \J—00 


h(t - s)N*(ds) 


df. 


We have established that the right-hand side of ( |60l l also converges to the right-hand side of ( |59l l. With the result from 
Proposition]^ on the uniqueness property of (|5^, we find that every subsequential limit N* has the same distribution 
as the Hawkes process N. We may then conclude that, for A —» 0, the approximating sequence of point processes 
^^(A)^ converges weakly to the Hawkes process N. □ 


3.3. Structural analogies 

Besides the formal convergence result from Theorem]^ we point out a number of more general, partly obvious 
structural parallels between the Hawkes and the INAR(oo) model. The branching structure of the INAR(oo) model 
described after Theorem[T]is the same as the branching structure of the Hawkes process described after Proposition]^ 
This similar underlying structure yields analogous equations for the moment-generating function of the INAR(oo) and 
the Laplace functional of the Hawkes process; see Theorem]^and Proposition]?] Consequently, we can expect similar 
distributional properties. In the following statements, we compare the models more directly by considering a specific 
Hawkes process N together with its approximating family of INAR(oo) sequences, , A e (0, 6), obtained from 

Theorem ]3 
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1. The defining equations (|^ and ( |59l l have similar interpretations: taking expectations conditional on 'H^ 
cr : k < n - l) on both sides of Q yields 


■x<*> 


n-\ 


— rj + ^ h{{n — k)A)X^i^^\ n e Z, 


k=—oo 


which is similar to the local version of ( |59| l 

E [Nidt)\9{^] 


dt 


T 

I 


h(t - s)N(ds), t 6 ' 


2. The stability criteria from Theorems and [^correspond: for the time series case, Ah(Ak) < 1 is a suffi 
cient existence condition which in the Hawkes case becomes h(t)dt < 1. 


Bremaud and Massoulie 


(20011 

establishes the existence of a nontrivial Hawkes process, where the total weight of the reproduction intensity 
equals 1 and the immigration intensity is zero. The analogous statement for INAR(oo) sequences can be derived 
in a similar way. 

3. From the correspondence of the generating functions, we know that the moments of the INAR(oo) and the 
Hawkes process must be similar. The first and second moments of both model classes can be calculated and 
compared explicitly. The first moments are 

EX^'^' n EA(df) 77 

- respectively, - 


1 - ’ 


df 


1-K 




., for 




A2 

i ^ J 


respectively, for 


we find at the origin 


respectively. 


r(f): 


E [dAodA,] / E N(dt) r 


(df)2 


I df ; ’ 


e Z, 


t € 




1 


A1 - 




^ Ah(kA)R(k), 


f=i 


r(0) = 


1 

dtl-K 


OO 

/ 

0 + 


h(s)r(s)ds. 


Implicit equations of Yule-Walker type are valid in both cases: 

OO 

R^^\n) = Yj h(Ak)R^^\\n\ - k)A, n + 

respectively. 


r=i 


r(f) 


OO 

-f 


h{s)r{\t\ - s)ds, f 0. 


(77) 


(78) 


(79) 


(80) 


Equations ( |77] i and ( [791 ) are standard facts given the representation of the INAR(oo) sequence as a standard 
AR(cx)) process in Proposition [T] ( [781 ) and ( [80l l are derived in Hawkes ( 1971b|l. 
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3.4. The choice of the counting sequence distribution 

As a last remark, we again refer to the choice of the counting sequence distribution in Definition [T] An obvious 
alternative to the Poisson distribution would have been the Bernoulli distribution; see the discussion after Dehnition]^ 
With the Bernoulli choice, each individual would have not more than one offspring at each future point in time 
instead of potentially unboundedly many. We want to indicate that in the limit (in the sense of Theorem where all 
reproduction coefficients go to zero) this option would yield the same result: for A e (0,1) and {af) c [0,1] such that 
2 Q'/t < 1, let Pois(AaA:), k € N, and Bernoulli(Aai), k 6 N. Then one can easily show that 


lim 

A-^0+ 


p[4^Uo] 

pjifUo] 


lim 


pjffUi] 


= 1 , 


k eN. 


So when A is very small, the offspring distribution candidates, Poisson and Bernoulli, become very similar. Roughly 
speaking, the limiting procedure in Theorem|^is nothing else than a (complicated) superposition of limits of the form 
2^1 For these kinds of sums, we have that 


lim 

A^0+ 


p 

2 ^ « 
/t=i 

p 

Ilf-n 
>=1 


= 1 , 


n e No. 


If a*: > 0 infinitely often, then—^by the Poisson limit theorem—the two considered probabilities are even equal for 
all A 6 [0,1]. In view of the above, one can expect that Theoremj^with the Bernoulli distribution as a starting point 
would yield the same limit as the Poisson distribution, namely the Hawkes process. 


4. Conclusion 

The mathematical formulation of the correspondence between INAR and Hawkes processes in Theoremj^has the fol¬ 
lowing heuristic interpretation relevant for practical applications: let be the approximating family of INAR(cx))- 
based point processes for a Hawkes process JV as in Theorem]^ Then for A > 0 (small), the hnite-dimensional 
distributions of are approximately equal to the hnite-dimensional distributions of N. In particular, we hnd, for 
n 6 N, 


(a((0. A]), a((A, 2A]), ..., N[i(n - 1)A, nA])) ^ A]), ^'^^((A, 2A]),..., N^^\i(n - 1 )A, «A]) j 

= (81) 

where is the INAR(oo) sequence that the point process is based on; see Theorem 0 So is an 

approximative model for the bin-count sequences of the considered Hawkes process N. This point of view can be 
very fertile. For example, it leads to a nonparametric estimation procedure for the Hawkes process. Instead of htting a 
Hawkes process directly, one hts the corresponding INAR(oo) model from Theoremj^on the bin-counts for some small 
A > 0. Kirchner ( 2015a| l gives a detailed discussion of this estimation procedure including asymptotic properties, bias 
issues and techniques for an optimal bin-size choice A. Also note that the Hawkes bin-count sequence view in ( [8T] i 
on the INAR(oo) model is another argument for the choice of the Poisson instead of the Bernoulli distribution for 
the counting sequences: clearly a Hawkes event can have potentially more than one direct offspring event in a future 
time-interval. 

For any No-valued time series model, one can construct a point process model the way it is done in Theorem]^ 
So, studying integer-valued time series can be inspiring for developing and understanding point process models. For 
example, one might want to consider the corresponding point process of an integer-valued autoregressive moving- 
average (INARMA) time series. For INARMA time series, a moving-average part is added to the autoregressive part 
in the dehning difference equations (|^; see |Fokianos and Kedem j20O. In fact, the resulting point process is nothing 
else but the “dynamic contagion process” as proposed in Zhao^^l2|l. Also for integer-valued time series theory, it 
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is inspiring to translate point process models into the discrete-time setup. For example, one might want to translate 
the generalizing results on selfexciting point processes in Bremaud and Massoulie (1996 1 to the INAR context. Here, 
the selfexcitement of the point process is modeled not as an affine but as a general Lipschitz function of the past of 
the process. The analogous generalization in time series theory is “nonlinear Poisson autoregression” as studied in 
Fokianos and Tj0stheim|(TO12|l for the case p = 1. In the latter paper, the authors also find a Lipschitz condition for 


the transfer function. Yet another idea might be to consider marked INAR sequences in analogy to marked Hawkes 
processes; see Liniger| ( f2009| l. 

One can expect that the results of our paper also hold in a multivariate setup—with the obvious modifications. 
However, the notation would become even more tedious, so that we have decided to concentrate on the univariate 
case. In view of the many INAR/Hawkes-correspondences presented in this paper we conclude: INAR(oo) sequences 
are discrete-time versions of Hawkes processes and, vice versa, Hawkes processes are continuous-time versions of 
INAR(cx)) sequences. 


Acknowledgements 

M.K. takes pleasure in thanking Thomas Mikosch for his most valuable comments on an earlier version of the pa¬ 
per and Paul Embrechts for guiding him to the topic of Hawkes processes and its applications to quantitative risk 
management. The author acknowledges financial support from RiskLab at the ETH Zurich and the Swiss Einance 
Institute. Eurthermore, M.K. thanks Isabel Marquez da Silva for sharing her expertise on integer-valued models and 
Rita Kirchner as well as Anne MacKay for help with the editing. 

References 

References 

Al-Osh, M., Alzaid, A., 1987. First order integer valued autoregressive process. Journal of Time Series Analysis 8, 261-275. 

Billingsley, R, 1968. Convergence of Probability Measures. John Wiley and Sons. 

Box, G., Jenkins, G., 1970. Time Series Analysis - Forecasting and Control. Holden Day, San Francisco. 

Bremaud, P., Massoulie, L., 1996. Stability of nonlinear Hawkes processes. The Annals of Probability 24 (3), 1563-1588. 

Bremaud, P, Massoulie, L., 2001. Hawkes branching processes without ancestors. Journal of Applied Probability 38, 122-135. 

Brockwell, P, Davis, R., 1991. Time Series: Theory and Applications, 2nd Edition. Springer. 

da Silva, I. M., 2005. Contributions to the Analysis of Discrete-Valued Time Series. PhD thesis, Depaitamento de Matematica Aplicada Faculdade 
de Ciencias da Universidade do Porto. 

Daley, D., Vere-Jones, D., 2003. An Introduction to the Theory of Point Processes, second edition Edition. Vol. I and II. Springer. 

Errais, E., Gieseke, K., Goldberg, L., 2010. Affine point processes and portfolio credit risk. Society for Industrial and Applied Mathematics: Journal 
on Financial Mathematics 1, 642-665. 

Fokianos, K., Kedem, B., 2012. Regression Models for Time Series Analysis. Wiley. 

Fokianos, K., Tj0stheim, D., 2012. Nonlinear Poisson autoregression. Annals of the Institute of Statistical Mathematics 64, 1205-1225. 

Hawkes, A., 1971a. Point spectra of some mutually-exciting point processes. Journal of the Royal Statistical Society: Series B 33, 438-443. 
Hawkes, A., 1971b. Spectra of some self-exciting and mutually-exciting point processes. Biometrika 58, 83-90. 

Hawkes, A., 1974. A cluster representation of a self-exciting point process. Journal of Applied Probability 11, 493-503. 

Kallenberg, O., 1983. Random Measures, 3rd Edition. Akademie-Verlag. 

Kirchner, M., 2015a. An estimation procedure for the Hawkes process. Working Paper, ETH Zurich. 

Kirchner, M., 2015b. Hawkes and INAR(oo) processes. Working Paper, ETH Zurich. 

Latour, A., 1997. The multivariate GINAR(/?) process. Advances in Applied Probability 29, 228-248. 

Li, J.-G., Yuan, Y, 1991. The integer-valued autoregressive (INAR(/?)) model. Journal of Time Series Analysis 12 (2), 129-142. 

Liniger, T., 2009. Multivariate Hawkes Processes,. PhD thesis, ETH Zurich. 

McKenzie, E., 1985. Some simple models for discrete variate time series. Water Resources Bulletin 21 (4), 654—650. 

Nakayama, M., Shahabuddin, P, Sigman, K., 2004. On finite exponential moments for branching processes and busy periods for queues. Journal 
of Applied Probability 41, 273-280. 

Ogata, Y, 1988. Statistical models for earthquake occurences and residual analysis for point processes. Journal of the American Statistical Associ¬ 
ation 83 (401), 9-27. 

Resnick, S., 1987. Extreme Values, Regular Variation, and Point Processes. Springer. 

Seneta, E., 1969. Functional equations and the Galton-Watson process. Advances in Applied Probability 1, 1^2. 

Steutel, F, van Ham, K., 1979. Discrete analogues of self-decomposability and stability. The Annals of Probability 7 (5), 893-899. 

Weiss, C., 2008. Thinning operations for modeling time series of counts—a survey. Journal of the American Statistical Association 92, 319-341. 
Whittle, P, 1951. Hypothesis Testing in Time Series Analysis. PhD thesis. University of Uppsala. 

Wiener, N., 1932. Tauberian theorems. Annals of Mathematics 33 (1), 1-100. 

Zhao, H., 2012. A Dynamic Contagion Process for Modelling Contagion Risk in Finance and Insurance. PhD thesis, The London School of 
Economics and Political Science. 


26 












